remus: proper cleanup on checkpoint failure.
authorShriram Rajagopalan <rshriram@cs.ubc.ca>
Fri, 8 Apr 2011 15:49:04 +0000 (16:49 +0100)
committerShriram Rajagopalan <rshriram@cs.ubc.ca>
Fri, 8 Apr 2011 15:49:04 +0000 (16:49 +0100)
commitd732509e34bc6278ec2d050808f8de15e5872c4d
tree5d3c92ca7311783255ecb4a67b585978534aab62
parentbe1806baf415c623403e4599fa197646b97a5f3d
remus: proper cleanup on checkpoint failure.

While running remus, when an error occurs during checkpointing
(e.g., timeouts on primary, failing to checkpoint network buffer
or disk or even communication failure) the domU is sometimes
left in suspended state on primary. Instead of blindly closing
the checkpoint file handle, attempt to resume the domain before
the close.

Signed-off-by: Shriram Rajagopalan <rshriram@cs.ubc.ca>
Committed-by: Ian Jackson <ian.jackson@eu.citrix.com>
tools/python/xen/lowlevel/checkpoint/checkpoint.c
tools/python/xen/remus/save.py